[docs] kernels by stevhliu · Pull Request #13139 · huggingface/diffusers

stevhliu · 2026-02-13T16:59:48Z

adds a kernels section in the Accelerate inference docs with the results:

cross-linked to Attention backends docs which demonstrates support for loading attention kernels with set_attention_backend
defer to the blog post and pipeline integration guide for more details about implementing non-attention kernels since this is more involved and already well-documented there

HuggingFaceDocBuilderDev · 2026-02-13T17:07:56Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Thanks a lot for prioritizing it.

sayakpaul · 2026-02-13T17:41:43Z

docs/source/en/optimization/fp16.md

+
+[Kernels](https://huggingface.co/docs/kernels/index) is a library for building, distributing, and loading optimized compute kernels on the [Hub](https://huggingface.co/kernels-community). It supports [attention](./attention_backends#set_attention_backend) kernels and custom CUDA kernels for operations like RMSNorm.
+
+The [Diffusers Pipeline Integration](https://github.com/huggingface/kernels/blob/main/skills/cuda-kernels/references/diffusers-integration.md) guide shows how to integrate a kernel. Create a custom optimized attention processor, patch all modules in the model, and inject the kernel into the pipeline.


The kernel skill basically lets users get an agent to write custom kernels for a model and hardware. It's not specific to the attention processor but also other modules as well such RMSNorm. Should we make it clearer?

lmk if this is clearer!

sayakpaul · 2026-02-13T17:42:06Z

docs/source/en/optimization/fp16.md

+> [!TIP]
+> Install the [add cuda-kernels](https://github.com/huggingface/kernels/blob/main/skills/cuda-kernels/SKILL.md) skill to teach Claude or Codex how to write a kernel. The [Custom kernels for all from Codex and Claude](https://huggingface.co/blog/custom-cuda-kernels-agent-skills) blog post covers this in more detail.
+
+For example, a custom RMSNorm kernel with [torch.compile](#torchcompile) speeds up LTX-Video generation 1.43x on an H100.


It wasn't just RMSNorm but also other modules implemented with custom kernels.

i mention RMSNorm as an example only for the benchmark results below

sayakpaul

Thanks! I would like to also see what @burtenshaw thinks about this.

kernels

1f88f8f

stevhliu requested a review from sayakpaul February 13, 2026 17:07

sayakpaul reviewed Feb 13, 2026

View reviewed changes

feedback

dd70a72

sayakpaul approved these changes Feb 14, 2026

View reviewed changes

stevhliu merged commit cbf4d9a into huggingface:main Mar 25, 2026
2 checks passed

stevhliu deleted the kernels branch March 25, 2026 16:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] kernels#13139

[docs] kernels#13139
stevhliu merged 2 commits intohuggingface:mainfrom
stevhliu:kernels

stevhliu commented Feb 13, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 13, 2026

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Feb 13, 2026

Uh oh!

stevhliu Feb 13, 2026

Uh oh!

sayakpaul Feb 13, 2026

Uh oh!

stevhliu Feb 13, 2026

Uh oh!

sayakpaul left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		[Kernels](https://huggingface.co/docs/kernels/index) is a library for building, distributing, and loading optimized compute kernels on the [Hub](https://huggingface.co/kernels-community). It supports [attention](./attention_backends#set_attention_backend) kernels and custom CUDA kernels for operations like RMSNorm.

		The [Diffusers Pipeline Integration](https://github.com/huggingface/kernels/blob/main/skills/cuda-kernels/references/diffusers-integration.md) guide shows how to integrate a kernel. Create a custom optimized attention processor, patch all modules in the model, and inject the kernel into the pipeline.

Conversation

stevhliu commented Feb 13, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 13, 2026

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

stevhliu Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

stevhliu Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sayakpaul left a comment •

edited

Loading